Maximization of AUC and Buffered AUC in Classification

نویسندگان

  • Matthew Norton
  • Stan Uryasev
چکیده

This paper utilizes a new concept, called Buffered Probability of Exceedance (bPOE), to introduce an alternative to the Area Under the Receiver Operating Characteristic Curve (AUC) performance metric called Buffered AUC (bAUC). Central to the creation of bAUC is a new technique for calculation and optimization of bPOE. We show this formula to be easily integrable into optimization frameworks, often reducing bPOE minimization to convex, sometimes even linear, programming. Then, we utilize bPOE to create the bAUC performance metric, showing it to be an intuitive counterpart to AUC. In addition, we show that bAUC is much easier to handle in optimization frameworks than AUC, specifically reducing to convex and linear programming. We use these friendly optimization properties to introduce the bAUC Efficiency Frontier, a concept that serves to partially resolve the “incoherency” that arises when misclassification costs need be considered. We conclude that bAUC avoids many of the numerically troublesome issues encountered by AUC and integrates much more smoothly into the general framework of model selection and evaluation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increasing the accuracy of the classification of diabetic patients in terms of functional limitation using linear and nonlinear combinations of biomarkers: Ramp AUC method

The Area under the ROC Curve (AUC) is a common index for evaluating the ability of the biomarkers for classification. In practice, a single biomarker has limited classification ability, so to improve the classification performance, we are interested in combining biomarkers linearly and nonlinearly. In this study, while introducing various types of loss functions, the Ramp AUC method and some of...

متن کامل

AUC Maximization with K-hyperplane

The area under the ROC curve (AUC) is a measure of interest in various machine learning and data mining applications. It has been widely used to evaluate classification performance on heavily imbalanced data. The kernelized AUC maximization machines have established a superior generalization ability compared to linear AUC machines because of their capability in modeling the complex nonlinear st...

متن کامل

Efficient AUC Maximization with Regularized Least-Squares

Area under the receiver operating characteristics curve (AUC) is a popular measure for evaluating the quality of binary classifiers, and intuitively, machine learning algorithms that maximize an approximation of AUC should have a good AUC performance when classifying new examples. However, designing such algorithms in the framework of kernel methods has proven to be challenging. In this paper, ...

متن کامل

Online AUC Maximization

Most studies of online learning measure the performance of a learner by classification accuracy, which is inappropriate for applications where the data are unevenly distributed among different classes. We address this limitation by developing online learning algorithm for maximizing Area Under the ROC curve (AUC), a metric that is widely used for measuring the classification performance for imb...

متن کامل

Stochastic Online AUC Maximization

Area under ROC (AUC) is a metric which is widely used for measuring the classification performance for imbalanced data. It is of theoretical and practical interest to develop online learning algorithms that maximizes AUC for large-scale data. A specific challenge in developing online AUC maximization algorithm is that the learning objective function is usually defined over a pair of training ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015